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We  investigate  ^efficient  implementations  of  Dijkstra’s  shortest  path  algo¬ 
rithm.  We  propose  a  new  data  structure,  called  the  redistributive  heap,  for  use  in 
this  algorithm.  On  a  network  with  n  vertices,  m  edges,  and  nonnegative  integer 
arc  costs  bounded  by  C,  a  one-level  form  of  redistributive  heap  gives  a  time 
bound  for  Dijkstra’s  algorithm  of  0(m  +  n  log  C).  A  two-level  form  of  redistri¬ 
butive  heap  gives  a  bound  of  0(m  +  n  log  Cl  log  log  C).  A  combination  of  a 
redistributive  heap  and  a  previously  known  data  structure  called  a  Fibonacci 
heap  gives  a  bound  of  0{m  +  n  Vlog  C?).  The  best  previously  known  bounds  are 
0(m  +  n  log  n )  using  Fibonacci  heaps  alone  and  0(m  log  log  C)  using  the  prior¬ 
ity  queue  structure  of  Van  Emde  Boas,  Kaas,  and  Zijlstra.  _ 
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1.  Introduction 

Lei  G  =  (V,£)  be  a  graph  with  vertex  set  V  of  size  n  and  arc  set  E  of  size  m.  Let  s  be  a  dis¬ 
tinguished  vertex  of  G  and  let  c  be  a  function  assigning  a  nonnegative  real-valued  cost  to  each  arc 
of  G.  We  denote  the  cost  of  (v,w)  e  E  by  r(v,w)  to  avoid  extra  parentheses.  The  single-source 
shortest  path  problem  is  that  of  computing,  for  each  vertex  v  reachable  from  s,  the  cost  of  a 
minimum-cost  path  from  s  to  v.  (The  cost  of  a  path  is  the  sum  of  the  costs  of  its  edges.)  We 
assume  that  all  vertices  are  reachable  from  s ;  if  this  is  not  the  case,  unreachable  vertices  can  be 
deleted  from  s  in  a  linear-time  preprocessing  step. 

The  theoretically  most  efficient  known  algorithm  for  this  problem  is  Dijkstra’s  algorithm 
[5].  Our  description  of  his  algorithm  is  based  on  that  in  Tarjan ’s  monograph  [  1 3].  The  algorithm 
maintains  a  tentative  cost  d(v)  for  each  vertex  v,  such  that  some  path  from  s  to  v  has  total  cost 
d(v).  As  the  algorithm  proceeds,  the  tentative  costs  decrease,  until  at  the  termination  of  the  algo¬ 
rithm,  for  each  vertex  v,  d(v)  is  the  cost  of  a  minimum-cost  path  from  s  to  v.  Initially  d(s)  =  0 
and  d(v)  =  «>  for  every  v  *  s.  The  algorithm  maintains  a  partition  of  the  vertices  into  three  states: 
unlabeled  vertices,  those  with  infinite  tentative  costs;  labeled  vertices,  those  with  finite  tentative 
cost  whose  minimum  cost  is  not  yet  known;  and  scanned  vertices,  those  whose  minimum  cost  is 
known.  Initially  s  is  labeled  and  all  other  vertices  are  unlabeled.  The  algorithm  consists  of 
repeating  the  following  step  until  all  vertices  are  scanned: 
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Scan  a  Vertex.  Select  a  labeled  vertex  v  such  that  d(v)  is  minimum  and  declare  v  scanned.  For 
each  arc  (v,w),  if  d(v)  +  c(v,w)  <  d(w),  replace  d( w)  by  d(v)  +  c(v,w)  and  declare  w  labeled  if  it 
is  currently  unlabeled. 

The  algorithm  can  easily  be  augmented  to  compute  actual  minimum-cost  paths  instead  of 
just  the  costs  of  such  paths.  This  computation  requires  only  O(m)  additional  time. 

The  key  to  efficient  implementation  of  Dijkstra’s  algorithm  is  the  use  of  a  data  structure 
called  a  heap  (or  priority  queue).  A  heap  consists  of  a  set  of  items,  each  with  an  associated  real¬ 
valued  key,  on  which  the  following  operations  are  possible. 

insert  ( h,x ):  Insert  new  item  x,  with  predefined  key,  into  heap  h. 

delete  min  ( h ):  Find  an  item  of  minimum  key  in  heap  h,  delete  it  from  h,  and  return  it  as  the 
result  of  the  operation. 

decrease  (h,x,  value):  Replace  by  value  the  key  of  item  x  in  heap  h\  value  must  be  smaller  than 
the  old  key  of  x. 

In  a  heap-based  implementation  of  Dijkstra’s  algorithm,  a  heap  h  contains  all  the  labeled 
vertices;  the  tentative  cost  of  a  labeled  vertex  is  its  key.  Initially  h  =  {5 }.  The  scanning  step  is 
implemented  as  follows: 

Scan  a  Vertex.  Let  v  =  delete  min  ( h ).  Declare  v  scanned.  For  each  arc  (v,h>),  if  d(w)  =  let 
d(w)  =  d(v)  +  c(v,,w)  and  perform  insert  (A,w);  if  d(w)  <  <*>  and  d(v)  +  c(v,w)  <  d(w)),  perform 
decrease  ( h,w,d(v )  +  c(v,w)). 

Dijkstra’s  algorithm  runs  in  0(m)  time  plus  the  time  required  to  perform  the  heap  opera¬ 
tions.  There  are  n  insert  operations  (counting  one  to  insert  s  initially),  n  delete  min  operations, 
and  at  most  m  -  n  +  1  decrease  operations.  Dijkstra’s  original  implementation  uses  an  array  to 
represent  the  heap,  giving  a  bound  of  0(1)  time  per  insert  or  decrease  and  O(n)  time  per  delete 
min,  or  0(n2)  time  overall.  A  more  modem  heap  implementation,  the  Fibonacci  heap  [7],  needs 
0(1)  time  per  insert  or  decrease  and  only  0(log  n)  per  delete  min,  for  an  overall  time  bound  of 
0(m  +  «  log  n).  The  same  bound  is  attainable  using  relaxed  heaps  [6]  or  Vheaps  [11], 

A  time  of  0(m  +  n  log  n)  is  best  possible  for  Dijkstra’s  algorithm,  if  the  arc  costs  are  real 
numbers  and  only  binary  comparisons  are  used  in  the  heap  implementation.  This  is  because  it  is 
easy  to  reduce  the  problem  of  sorting  n  numbers  to  a  run  of  Dijkstra’s  algorithm.  The  question 


arises  whether  the  0(m  +  n  log  n)  bound  can  be  beaten  in  the  special  case  that  all  the  arc  costs 
are  integers  of  moderate  size.  This  is  the  question  we  explore  in  this  paper. 

Henceforth  we  assume  that  all  arc  costs  are  integers  bounded  above  by  C.  Under  this 
assumption,  a  data  structure  of  Van  Emde  Boas,  Kaas,  and  Zijlstra  [15,16]  can  be  used  to  imple¬ 
ment  the  heap  in  Dijkstra’s  algorithm,  giving  a  time  bound  of  0(log  log  C)  per  heap  operation,  or 
0(m  log  log  C)  time  in  total.  The  space  needed  for  the  heap  is  0(n  +  C),  but  this  can  be  reduced 
to  0{n  +  Cc)  for  any  positive  constant  e  using  tries  [12],  or  even  to  0{n)  if  dynamic  perfect  hash¬ 
ing  is  used  [4].  (Use  of  dynamic  perfect  hashing  makes  the  algorithm  randomized  instead  of 
deterministic  and  the  time  bound  expected  instead  of  worst-case.) 

The  existence  of  an  0(m  +  n  log  n)  bound  for  arbitrary  real-valued  costs  suggests  the  prob¬ 
lem  of  obtaining  a  bound  for  integer  costs  of  the  form  0(m  +  nf(C ))  for  some  function  /  of  the 
number  sizes,  with  /  growing  as  slowly  as  possible.  An  algorithm  independently  discovered  by 
Dial  [3]  and  Johnson  [8]  runs  in  0(m  +  nC)  time.  Based  on  the  existence  of  the  Van  Emde 
Boas-Kaas-Zijlstra  data  structure,  one  might  hope  for  a  bound  of  0(m  +  n  log  log  C).  Obtaining 
such  a  bound  is  an  open  problem.  We  shall  develop  a  data  structure  that  results  in  a  bound  of 
0(m  +  n  Vlog C ).  Our  data  structure,  the  redistributive  heap,  exploits  special  properties  of  the 
heap  operations  in  Dijkstra’s  algorithm.  The  most  important  of  these  properties  is  that  successive 
delete  min  operations  return  vertices  in  nondecreasing  order  by  tentative  cost.  The  simplest  form 
of  the  data  structure,  the  one-level  redistributive  heap,  was  originally  proposed  by  Johnson  [9], 
who  used  it  to  obtain  an  0(m  log  log  C  +  n  log  C  log  log  C)  time  bound  for  Dijkstra’s  algorithm. 
By  slightly  changing  the  implementation,  we  reduce  the  time  to  0{m  +  n  log  C).  Section  2 
describes  this  result. 

By  adding  another  level  to  the  structure,  we  obtain  a  two-level  redistributive  heap.  The  idea 
of  adding  a  second  level  is  borrowed  from  Denardo  and  Fox  [2].  The  new  structure  reduces  the 
running  time  of  Dijkstra’s  algorithm  to  0(m  +  n  log  C  /  log  log  C).  Section  3  presents  this 
result.  One  more  change  to  the  structure,  the  addition  of  Fibonacci  heaps  in  the  second  level, 
reduces  the  time  bound  further,  to  0(m  +  n  Vlog  C ).  Section  4  discusses  this  improvement. 

Section  5  discusses  the  effect  of  increasing  the  cost  of  doing  arithmetic;  all  the  results  men¬ 
tioned  above  are  predicated  on  the  assumption  that  integers  of  size  0(nC)  can  be  added  or  com¬ 
pared  in  constant  time.  In  the  semilogarithmic  model  studied  in  Section  5,  in  which  arithmetic  on 
integers  of  Oflog  n)  bits  takes  0(1)  time,  the  m-term  in  the  bounds  given  above  increases  to 
0{m  log  C/log  n),  while  the  n-term  remains  the  same. 


2.  One-Level  Redistributive  Heaps 

Redistributive  heaps  rely  on  the  following  properties  of  Dijkstra’s  algorithm: 

(i)  For  any  vertex  v,  if  d(v)  is  finite,  d(v)  e  [0..nC]. 1 

(ii)  For  any  vertex  v  *s,  if  v  is  labeled,  d(v)  e  [d(x)..d(x)  +  C],  where  x  is  the  most  recently 
scanned  vertex. 

Property  (ii)  implies  in  particular  that  successive  delete  min  operations  return  vertices  in 
nondecreasing  order  by  tentative  cost. 

A  one-level  redistributive  heap  is  a  collection  of  B  =  T  lg  (C  +  1)]  +2  buckets, 2  indexed 
from  1  through  B.  Each  bucket  has  an  associated  size.  The  size  of  bucket  i  is  denoted  by  sized) 
and  defined  as  follows: 

size  (1)  =  1; 

size(i)  =  2‘-2  for  2<i<  B-l; 
size(B)-  nC  +  1. 

Observe  that  the  bucket  sizes  satisfy  the  following  important  inequality: 
y-i 

(1)  X  sized)  £  min  (size  (j),  C  +1 }  for  2  <  j  £  B. 

i  =  1 

Each  bucket  also  has  a  range  that  is  an  interval  of  integers.  Initially  the  ranges  of  the  buck¬ 
ets  partition  the  interval  [0..nC  +  1],  In  general  the  ranges  partition  the  interval  \dmm..nC  +  1], 
where  d^  is  the  maximum  label  of  a  scanned  node.  For  each  bucket  i  the  upper  bound  ud)  of 
its  interval  is  maintained;  the  range  of  bucket  i  is  range(i)  =  [u(i-l)  +  1 ..«(/)],  with  the  conven¬ 
tions  that  ufO^dnrin-l  and  ranged)  -0  if  ud- 1)  ^  ud)-  Whereas  the  sizes  of  all  buckets  are 
fixed  throughout  the  computation,  the  ranges  change;  for  each  bucket  i,  ud)  is  a  nondeercasing 
function  of  time. 

1  We  denote  the  interval  of  integers  (xl/<x£u)  by  [/..«J. 

2We  denote  log  2  by  lg. 


Initially  «(/)  =  2,_1  -1  for  1  <i< B- 1.  u(B)  =  nC  +  1.  Observe  that  this  implies  I  range( i)  I 
£  sized).  This  inequality  is  maintained  throughout  the  computation  for  each  bucket.  The  labeled 
vertices  are  stored  in  the  buckets,  with  vertex  v  stored  in  bucket  i  if  d(v)  e  ranged).  Initially,  ver¬ 
tex  s  is  inserted  into  bucket  1.  The  range  of  bucket  1  is  maintained  so  that  every  vertex  v  in 
bucket  1  has  d(v)  =  u(l);  thus  the  effective  range  of  bucket  1  contains  only  «(1). 

Each  bucket  is  represented  by  a  doubly-linked  list  of  its  vertices,  to  make  insertion  and 
deletion  possible  in  constant  time.  In  addition,  stored  with  each  vertex  is  the  index  of  the  bucket 
containing  it. 

The  three  heap  operations  are  implemented  as  follows.  To  insert  a  newly  labeled  vertex  v, 
examine  values  of  i  in  decreasing  order,  starting  with  i  =  B,  until  finding  the  largest  i  such  that 
ud)  <  d(y)\  then  insert  v  into  bucket  i  +  1.  To  decrease  the  key  of  a  vertex  v,  remove  v  from  its 
current  bucket,  say  bucket  j.  Reduce  the  key  of  v.  Proceed  as  in  an  insertion  to  reinsert  v  into  the 
correct  bucket,  but  begin  with  bucket  i  =  j. 

For  a  single  vertex  v,  the  time  spent  on  insertion  and  all  decrease  operations  is  0(log  C) 
plus  0(  1)  per  decrease,  because  the  index  of  the  bucket  containing  v  can  never  increase.  Thus 
the  total  time  for  all  such  operations  during  a  run  of  Dijkstra’s  algorithm  is  0(m  +  n  logC). 

The  most  complicated  operation  is  delete  min,  which  is  performed  as  follows.  If  bucket  1  is 
nonempty,  return  any  vertex  in  bucket  1 .  Otherwise,  find  the  nonempty  bucket  of  smallest  index, 
say  bucket  j.  By  scanning  through  the  items  in  bucket  j,  find  a  vertex  of  smallest  tentative  cost, 
say  v.  Save  v  to  return  as  the  result  of  the  delete  min  and  distribute  the  remaining  vertices  in 
bucket  i  among  buckets  of  smaller  index,  as  follows.  Replace  n(0)  by  d(v)-l,  u(l)  by  d(v),  and 
for  i  running  from  2  through  j- 1,  replace  ud)  by  min  {k(i'-1)  +  sized),  «(/))•  Remove  each  ver¬ 
tex  from  bucket  j  and  reinsert  it  as  in  decrease ;  do  not  reinsert  v. 

Inequality  (1)  guarantees  that,  if  "Z  2  in  a  delete  min,  every  vertex  in  bucket  j  will  move  to 
a  bucket  of  strictly  smaller  index.  It  follows  that  the  time  spent  on  delete  min  operations  is 
0(log  C)  per  delete  min  plus  0(log  C)  per  vertex,  for  a  total  of  0(n  log  C)  during  a  run  of 
Dijkstra’s  algorithm.  We  conclude  that  the  total  running  time  of  Dijkstra’s  algorithm  with  this 
implementation  is  0(m  +  n  log  C).  The  space  required  is  0(m  +  log  C).  Johnson,  using  the 
same  data  structure,  obtained  a  bound  worse  by  a  factor  of  log  log  C,  because  he  used  binary 
search  instead  of  sequential  scan  to  reinsert  vertices  into  buckets. 

3.  Two-Level  Redistributive  Heaps 

Reducing  the  running  time  of  the  algorithm  of  Section  2  requires  reducing  the  number  of 
reinsertions  of  vertices  into  buckets.  This  can  be  done  by  increasing  the  bucket  sizes,  but  then 
inequality  (1)  no  longer  holds.  We  overcome  this  problem  by  dividing  each  bucket  into 
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segments.  All  segments  within  a  bucket  have  the  same  size. 

A  two-level  redistributive  heap  is  defined  by  a  parameter  K,  determining  the  number  of  seg¬ 
ments  within  a  bucket.  The  number  of  buckets  is  B  =["log*(C  +  1)]  +1.  The  sizes  of  the  buck¬ 
ets  are  as  follows: 

sized)  =  K‘  for  1  < i  SB-1; 
size(B)  =  nC  +  1. 

As  in  the  one-level  scheme,  bucket  i  has  ranged)  =  [«(/— 1)  +  1 ..«(/)],  with  u(0)  =  </min-l 
and  u(B)  =  nC  +  1.  The  remaining  upper  bounds  on  ranges  have  the  following  initial  values: 

u(j)  =  £  -1  for  1  <  j  <  B- 1.  ! 

i  =  i  ! 

For  1  Si  SB-1,  bucket  i  is  partitioned  into  K  segments,  each  of  size  K‘~x .  Segments  arc 
indexed  by  ordered  pairs;  segment  d,k)  is  segment  k  of  bucket  i.  Bucket  B  consists  of  a  single 
segment. 

Each  segment  has  an  associated  range,  which  is  a  function  of  the  range  of  its  bucket.  The  < 

range  of  segment  (t.Jfc)  is  ranged,k)  =  [«(i,ifc-l)  +  1. .«(/,*)]  where  ud.k)  is  defined  as  follows:  ; 

*1 

» 

ud,k)  =  max  {w(i-l),  ud)~  (K-k)K‘~x }.  \ 

Observe  that  I  range(i,k)  I  <  K‘~x  =  sized, k)  for  1  <  i  <  B-l  and  1  <k<K.  The  algorithm  •! 

maintains  the  invariant  that  for  1  <  i  <  B-l,  I  ranged)  \  ) 

The  algorithm  maintains  the  ranges  of  buckets  (i.e.  the  h(i)’s)  explicitly,  but  computes  the 
ranges  of  segments  as  needed.  Observe  that,  given  an  integer  x  e  ranged),  the  value  of  k  such 
that  x  €  ranged.k)  can  be  computed  in  constant  time  using  the  formula 


k  =  K-l(udyx)  /  K1-']  . 


Choosing  K  to  be  a  power  of  two  simplifies  this  computation  on  a  computer  whose  number 
representation  is  binary,  but  this  is  not  necessary  for  our  theoretical  results. 


The  labeled  vertices  are  stored  in  the  segments,  with  vertex  v  stored  in  segment  d,k)  if 
d(v)e  range(i,k).  Each  segment  is  represented  by  a  doubly-linked  list.  The  three  heap 
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operations  are  implemented  as  follows.  An  insert  or  decrease  operation  on  a  vertex  v  is  per¬ 
formed  as  in  a  one-level  heap,  except  that  once  the  bucket  i  such  that  d(v)  e  range(i)  is  located, 
the  k  such  that  d(v)  e  range(i,k)  is  computed  (in  constant  time),  and  v  is  inserted  into  segment 
(i,k).  The  total  time  for  all  insert  and  decrease  operations  during  a  run  of  Dijkstra’s  algorithm  is 
0(m  +  Bn)  -  0(m  +  n  log*  C). 

The  delete  min  operation  is  implemented  much  as  in  a  one-level  heap,  except  that  only  the 
contents  of  a  single  segment  are  distributed,  not  the  contents  of  an  entire  bucket.  To  perform 
delete  min,  find  the  first  non-empty  bucket,  say  j.  Find  the  first  nonempty  segment  within  bucket 
j,  say  O',*)-  (If  j  =  B,  k  =  1,  since  bucket  B  consists  of  only  a  single  segment.)  If  j  =  1,  remove 
and  return  any  vertex  in  segment  (J,k).  Otherwise,  scan  the  vertices  in  segment  (J,k)  to  find  one, 
say  v,  with  minimum  tentative  cost.  Redefine  u(i )  for  0  <  i  <  j- 1  as  in  a  one-level  heap.  Distri¬ 
bute  all  vertices  in  segment  (J,k)  (except  v)  into  their  new  correct  segments,  which  lie  in  buckets 
1  through  j- 1. 

A  few  details  of  the  data  structure  deserve  comment.  To  facilitate  locating  the  first 
nonempty  bucket,  a  bit  for  each  bucket  is  maintained  that  indicates  whether  or  not  the  bucket  is 
empty.  Determining  j  in  a  delete  min  then  takes  0(B)  time.  The  segments  are  represented  as  an 
array  of  doubly-linked  lists,  with  the  index  of  segment  (j.jfe)  being  K(i-l)  +  k.  Since  each  vertex 
in  a  segment  that  is  distributed  moves  to  a  lower  bucket,  the  total  number  of  such  movements  is 
O(Bn).  The  total  time  for  all  the  delete  min  operations  is  0(Bn)  plus  the  lime  for  n  steps  of  the 
form,  “find  the  first  nonempty  segment  in  a  given  bucket.” 

If  each  such  segment  is  found  merely  by  scanning  all  the  segments  in  the  bucket,  the  time 
for  one  such  step  is  0(K),  and  the  total  running  time  of  Dijkstra’s  algorithm  is  0(m  +  (B  +  K)n). 
Choosing  K  proportional  to  log  C  /  log  log  C  gives 

B  =ri°gtf  (C  +  1)1  +  1  =  0(logC  /  log  log  C),  and  the  total  running  time  is 
0(m  +  n  log  C  /  log  log  C).  The  space  required  is  0(m  +  (log  C  /  log  log  C)2),  reducible  to 
0(m  +  (log  C  /  log  log  Cf  )  for  any  constant  e  >  0  using  a  trie  [12]  or  even  to  0(m)  using  ran¬ 
domization  and  dynamic  perfect  hashing  [4], 

If  C  <  n,  the  running  time  of  the  algorithm  can  be  reduced  to  0(m  +  n  log  C  /  log  log  n)  by 
using  table  lookup  to  find  nonempty  segments.  Specifically,  choose  K  =Tlg  n]  .  For  each  bucket 
(other  than  bucket  B),  maintain  an  integer  of  [lg  n)  bits  whose  k,h  bit  is  one  if  segment  k  of  the 
bucket  is  nonempty  and  zero  otherwise.  During  a  preprocessing  step,  construct  an  array  of  n 
positions,  indexed  from  1  to  n,  such  that  position  i  contains  k  if  and  only  if  the  k'M  bit  of  i 
(expressed  in  binary)  is  the  first  nonzero  bit.  Construction  of  this  array  takes  O(n)  time,  and  once 
the  array  is  constructed,  the  first  nonempty  segment  of  a  nonempty  bucket  can  be  found  in  0(  1) 
time  by  accessing  the  array  position  indexed  by  the  integer  encoding  the  nonempty  segments. 
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By  choosing  the  appropriate  one  of  the  two  methods  above  for  finding  the  first  nonempty 
segment  in  a  bucket,  we  obtain  a  time  bound  of  0(m  +  n  log  C/  log  log  (nC)>  for  Dijkstra’s  algo¬ 
rithm. 

4.  Use  of  Fibonacci  Heaps 

Our  final  improvement  reduces  the  running  time  of  Dijkstra’s  algorithm  to 
0(m  +  n  Vlog  C )  by  using  a  variant  of  Fibonacci  heaps  to  find  nonempty  segments.  Throughout 
this  section  we  shall  refer  to  each  segment  by  its  index;  as  in  Section  3,  the  index  of  segment  (i,k) 
is  K  (i— 1)  +  k,  which  is  an  integer  in  the  interval  [I..KB-K  +  1].  We  associate  with  each  labeled 
vertex  the  index  of  the  segment  containing  it.  We  need  to  be  able  to  maintain  the  collection  of 
labeled  vertices  under  the  following  three  kinds  of  operations: 

delete  min:  Find  a  labeled  vertex  of  minimum  index,  mark  it  scanned,  and  return  it. 

insert(x ):  Declare  x  to  be  a  newly  labeled  vertex,  with  predefined  index. 

decrease{x,value)\  Replace  the  index  of  labeled  vertex  x  by  value;  value  must  be  smaller  than  the 
old  index  of  x. 

In  other  words,  we  must  maintain  the  set  of  labeled  vertices  as  a  heap,  with  the  key  of  each 
vertex  equal  to  its  index.  A  run  of  Dijkstra’s  algorithm  requires  n  insert  operations,  n  delete  min 
operations,  and  at  most  m  decrease  operations,  in  addition  to  the  time  for  maintaining  bucket 
boundaries  and  recomputing  segments.  The  total  time  for  all  the  latter  bookkeeping  is 
0(m  +  Bn). 

Fibonacci  heaps  (abbreviated  F-heaps)  support  delete  min  in  0(log  n)  amortized  time  1  and 
insert  and  decrease  in  0(1)  amortized  time  [7],  where  n  is  the  maximum  heap  size.  But  in  our 
application,  the  number  of  possible  index  values  is  much  smaller  than  the  number  of  vertices. 
We  shall  describe  how  to  extend  Fibonacci  heaps  so  that  if  the  keys  are  integers  in  the  interval 
[1JV],  the  amortized  time  per  delete  min  is  OOog  min  { n,N }),  while  the  amortized  time  per 
insert  or  decrease  remains  0(1).  In  the  application  at  hand,  we  can  take  N  =  KB.  The  choice  of 
K  =  $ gives  B  =  OOog*  C)  =  0(Vlog  C ),  and  log  N  =  0(V log  C );  therefore  the  total  run¬ 
ning  time  of  Dijkstra’s  algorithm  is  0(m  +  n  Vlog  C). 

1  By  amortized  time  we  mean  the  time  per  operation  averaged  over  a  worst-case  sequence  of  operations.  See 
Taijan’s  survey  paper  [14]  or  Mehlhom’s  book  [10]. 
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It  remains  for  us  to  make  the  necessary  changes  to  F-heaps.  The  main  idea  is  to  make  sure 
that  such  a  heap  contains  at  most  N  items,  i.e.,  at  most  one  item  per  key  value.  Making  this  idea 
work  in  the  presence  of  decrease  operations  requires  some  care  and  some  knowledge  of  the  inter¬ 
nal  workings  of  F-heaps. 

We  need  to  know  the  following  facts  about  F-heaps.  An  F-heap  consists  of  a  collection  of 
heap-ordered  trees  whose  nodes  are  the  items  in  the  heap.  (A  heap-ordered  tree  is  a  rooted  tree 
such  that  if  p(x)  is  the  parent  of  node  x,  the  key  of  x  is  no  greater  than  the  key  of  p(x).)  Each 
node  in  an  F-heap  has  a  rank  equal  to  the  number  of  its  children.  A  fundamental  operation  on  F- 
heaps  is  linking,  which  combines  two  heap  ordered  trees  into  one  by  comparing  the  keys  of  their 
roots  and  making  the  root  of  smaller  key  the  parent  of  the  root  of  larger  key,  breaking  a  tie  arbi¬ 
trarily.  A  link  operation  takes  <9(1 )  time.  Only  trees  with  roots  of  equal  rank  are  linked. 

Each  nonroot  node  in  a  F-heap  is  in  one  of  two  states,  marked  or  unmarked.  When  a  node 
becomes  a  nonroot  by  losing  a  comparison  during  a  link,  it  becomes  unmarked.  Nodes  become 
marked  during  decrease  operations,  as  described  below. 

The  three  heap  operations  are  performed  as  follows.  To  insert  a  new  item,  merely  make  it 
into  a  one-node  tree  and  add  this  tree  to  the  collection  of  trees.  This  takes  0(1)  time. 

To  perform  a  decrease  operation  on  a  node  x,  begin  by  updating  the  key  of  x.  Then,  if  x  is 
not  a  root,  cut  the  edge  joining  x  and  p(x)  and  repeat  the  following  step,  with  y  initially  equal  to 
the  old  p(x),  until  y  is  unmarked  or  y  is  a  tree  root;  cut  the  edge  joining  y  and  its  parent  p(y),  and 
replace  y  by  the  old  p(y).  After  the  last  such  cut,  if  the  last  node  y  is  not  a  root,  mark  it. 

The  overall  effect  of  such  a  decrease  operation  is  to  possibly  break  the  initial  tree  contain¬ 
ing  x  into  several  trees,  one  of  which  has  root  x.  The  time  required  by  the  decrease  operation  is 
0(  1)  plus  0(1)  per  cut.  Since  only  one  node  is  marked  per  cut,  and  since  one  node  becomes 
unmarked  per  cut  except  for  at  most  one  cut  per  decrease  operation,  the  total  number  of  cuts  dur¬ 
ing  a  sequence  of  decrease  operations  is  at  most  twice  the  number  of  decrease  operations,  even 
though  a  single  decrease  can  result  in  many  cuts. 

To  perform  the  third  heap  operation,  delete  min,  scan  all  the  tree  roots  and  identify  one,  say 
x,  of  minimum  key.  Remove  x  from  its  tree,  thereby  making  each  of  its  children  a  tree  root. 
Finally,  repeatedly  link  trees  whose  roots  have  equal  rank,  until  no  two  tree  roots  have  equal 
rank. 

The  key  to  the  analysis  of  F-heaps  is  that  manipulation  of  rooted  trees  in  the  ways  described 
above  maintains  the  following  invariant:  for  any  node  x,  rank(x )  =  <9 Gog  size(x)),  where  size(x)  is 
the  number  of  nodes  in  the  subtree  rooted  at  x.  A  simple  analysis  gives  an  amortized  time  bound 
of  (9(1)  for  insert  and  decrease,  and  <9(log  n)  for  delete  min. 
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Now  we  extend  F-heaps  to  reduce  the  amortized  time  per  delete  min  to  <9  Gog  min  [n,N]). 
For  each  value  i  e  [l.JV],  the  algorithm  maintains  the  set  S(i)  of  items  with  key  i.  One  item  in 
S(i)  is  designated  the  representative  of  S(i).  All  the  items,  both  the  representatives  and  the  non¬ 
representatives,  are  grouped  into  heap-ordered  trees  of  the  kind  manipulated  by  the  F-heap  algo¬ 
rithm.  These  trees  are  divided  into  two  groups:  active  trees,  those  whose  roots  are  representa¬ 
tives,  and  passive  trees,  those  whose  roots  are  nonrepresentatives. 

The  algorithm  maintains  the  following  two  invariants: 

(i)  The  key  of  a  nonroot  node  x  is  strictly  less  than  that  of  its  parent  (a  strengthening  of  the 
heap  order  property): 

(ii)  Every  nonrepresentative  is  a  root. 

Invariant  (ii)  implies  that  all  nodes  in  active  trees  are  representatives  and  hence  have  dis¬ 
tinct  keys;  thus  the  number  of  nodes  in  active  trees  is  at  most  N.  Invariants  (i)  and  (ii)  together 
imply  that  the  representative  of  minimum  key  is  the  root  of  an  active  tree;  hence  delete  min  need 
only  scan  the  roots  of  active  trees. 

The  three  heap  operations  are  performed  as  follows.  To  insert  an  item  x,  make  it  into  a  one 
node  tree,  which  becomes  active  or  passive  depending  on  whether  the  set  S(i)  into  which  x  is 
inserted  is  empty  or  not;  if  it  is,  x  becomes  the  representative  of  S(i).  To  perform  a  decrease 
operation,  proceed  as  on  an  ordinary  F-heap  as  described  above,  with  the  following  addition: 
move  x  from  its  old  set,  say  S(i),  to  the  appropriate  new  set,  say  S(J).  Make  some  other  item  (if 
any)  in  S(i ),  say  y,  the  representative  of  S(i)  and  make  the  tree  with  root  y  active.  If  x  is  the  only 
item  in  S(J),  make  the  tree  rooted  at  x  active;  otherwise,  make  it  passive.  Make  other  new  trees 
created  by  cuts  active.  The  total  time  required  by  a  decrease  operation  is  (9(1)  plus  (9(1)  per  cut, 
including  the  time  to  move  trees  between  the  active  and  passive  groups. 

To  perform  delete  min,  proceed  as  on  an  ordinary  F-heap,  with  the  following  changes:  scan 
only  the  roots  of  active  trees  to  find  a  minimum,  and  perform  repeated  linking  only  on  active 
trees;  that  is,  after  deleting  the  active  node  of  minimum  key,  repeatedly  link  active  trees  whose 
roots  have  equal  rank  until  all  active  trees  have  roots  of  different  ranks. 

The  efficiency  analysis  of  the  extended  data  structure  is  almost  the  same  as  that  of  the  origi¬ 
nal.  Define  the  potential  of  the  data  structure  to  be  the  number  of  trees  plus  twice  the  number  of 
marked  nodes.  Define  the  amortized  time  of  a  heap  operation  to  be  its  actual  time  (measured  in 
suitable  units)  plus  the  net  increase  in  potential  it  causes.  The  initial  potential  is  zero  (if  the  ini¬ 
tial  heap  is  empty)  and  the  potential  is  always  nonnegative.  It  follows  that,  for  any  sequence  of 
heap  operations,  the  total  amortized  time  is  an  upper  bound  on  the  total  actual  time. 
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The  amortized  time  of  an  insertion  is  0(1),  since  it  increases  the  potential  by  one.  A 
decrease  operation  causing  k  cuts  adds  0(1)-*  to  the  potential:  each  cut  except  for  at  most  one 
adds  a  tree  but  removes  a  marked  node;  marked  nodes  count  for  two  in  the  potential.  Thus  a 
decrease  takes  0(1)  amortized  time  if  a  cut  is  regarded  as  taking  unit  time. 

Each  link  during  a  delete  min  operation  reduces  the  potential  by  one  and  thus  has  an  amor¬ 
tized  time  of  zero,  if  a  link  is  regarded  as  taking  unit  time.  Not  counting  links,  the  time  spent 
during  a  delete  min  is  0(log  min  [n,N]),  as  is  the  increase  in  potential  caused  by  removing  a 
node  of  minimum  key:  the  maximum  rank  of  any  node  is  0(log  min  {n,N}).  Thus  the  amortized 
time  of  delete  min  is  0(log  min  {n,N}),  as  desired. 

The  idea  used  here,  that  of  grouping  trees  into  active  and  passive,  applies  as  well  to  Vheaps 
(11]  to  give  the  same  time  bounds,  but  it  does  not  seem  to  apply  to  relaxed  heaps  [6].  The 
extended  F-heap,  if  used  directly  in  the  implementation  of  Dijkstra’s  algorithm,  gives  a  running 
time  of  0(m  +  n  log  C),  the  same  as  that  obtained  in  Section  2. 

5.  Time  Bounds  in  a  Semilogarithmic  Computation  Model 

In  the  previous  sections,  we  analyzed  our  algorithms  using  a  unit-cost  random-access 
machine  [1]  as  the  computation  model.  In  particular,  we  assumed  that  addition  and  comparison 
of  integers  in  the  interval  [0..nC]  takes  0(1)  time.  If  C  is  large,  this  assumption  may  not  be  real¬ 
istic.  In  this  section,  we  derive  bounds  for  the  algorithms  assuming  a  semilogarithmic  cost  com¬ 
putation  model.  We  show  that  in  this  model,  the  m-term  in  our  bounds  becomes  mflgC/lgn]  , 
while  the  /i-term  remains  unchanged.  The  following  two  assumptions  define  the  semilogarithmic 
model: 

(1)  Arithmetic  on  integers  of  length  0(log  n)  and  all  other  random  access  machine  operations 
(index  calculations,  pointer  assignments,  etc.)  take  0(1)  time; 

(2)  log  C  »  n0(,). 

In  our  algorithms,  we  represent  arc  costs  and  tentative  costs  d(v)  as  arrays  of  length 
I” lg  (nC  +  1)  /  F]  ,  where  F  =[lg  n  /  2]  .  Each  array  element  is  an  integer  in  the  range  (0..2*-l). 
By  assumption  (1),  indexing  into  these  arrays  takes  0(1)  time,  but  this  is  only  reasonable  if  the 
indices  are  0Qog  n)  in  length,  which  is  the  reason  for  imposing  assumption  (2).  Henceforth  in 
this  section  we  also  assume  that  log  C  =  Q  (log  a);  if  log  C  =  0(log  n),  the  bounds  of  the  previ¬ 
ous  sections  hold  without  change  for  the  semilogarithmic  model.  If  log  C  =  CJ  (log  n),  then 
lg(nC  + 1)  =  0(log  C),  a  fact  that  we  shall  use  repeatedly  without  further  comment. 
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Let  us  first  analyze  the  algorithm  of  Section  2.  We  shall  revise  and  reformulate  the  algo¬ 
rithm  to  fit  into  the  semilogarithmic  model  better.  In  particular,  we  emphasize  the  bit  manipula¬ 
tion  involved  in  the  computation.  Let  B  =  fig  (rtC  +  1)1  .  At  a  given  time  in  the  algorithm,  let  V 

be  a  labeled  vertex  with  minimum  tentative  cost  d(v).  Let  ag_i  ...ai  be  the  binary  representation 

a-i  . 

of  d(v),  i.e.  a,  e  (0,1)  and  d(v)=  £  a,  2'  .  The  algorithm  maintains  buckets  numbered  1 

i-t 

through  B  containing  the  labeled  vertices,  with  bucket  1  containing  every  vertex  u  such  that 
d(u)  =  d(v)  and  bucket  i,  for  2  <i£B,  containing  every  vertex  u  such  that  position  i-1  is  the 
largest  position  in  which  the  binary  representations  of  d(u)  and  d(v)  differ.  Note  that  in  this  vari¬ 
ant  of  the  Section  2  algorithm,  bucket  ranges  are  represented  implicitly  rather  than  explicitly. 

Finding  the  smallest  nonempty  bucket  by  a  sequential  scan  over  the  buckets  takes 
0(log  (nC  +  1))  =  O  (log  C)  time.  Distributing  the  vertices  in  a  bucket  is  done  by  scanning  down 
through  the  appropriate  bits  of  the  tentative  costs  of  the  vertices  in  the  bucket.  Such  distribution 
takes  £)(log  (nC  +  \))~0 (log  C)  time  per  vertex  over  the  entire  algorithm.  (Extracting  the 
appropriate  bit  of  a  tentative  cost  can  be  done  either  by  appropriate  shifting  and  masking  opera¬ 
tions,  or,  if  these  are  not  available,  by  table  lookup.  In  either  case,  the  time  to  extract  a  bit  is 
0(1).)  Updating  tentative  costs  takes  0(m  log  C  /  log  n )  time  over  the  entire  algorithm.  It  fol¬ 
lows  that  the  total  running  time  of  Dijkstra's  algorithm  is  0(m  log  C  /  log  n  +  n  log  C). 

Next  we  turn  to  the  algorithms  of  Sections  3  and  4.  In  these  algorithms  each  bucket  is 
divided  into  K  segments.  In  the  spirit  of  assumption  (1),  we  restrict  ourselves  to 
log  K  =  0(log  n).  The  number  of  buckets  is  B  =  f  log*-  (nC  +  1)]  .  The  assignment  of  labeled 
vertices  to  buckets  and  segments  is  as  follows.  Let  (Xs-i  -Cic  be  the  A"- ary  expansion  of  the 
minimum  tentative  cost  of  a  labeled  vertex,  say  v.  A  labeled  vertex  u  belongs  to  segment  k  of 
bucket  i  if  either  i  =  1,  k  =  oto  +  1,  and  d(u)  =  d(v);  or  if  i-l  is  the  largest  position  at  which  the 
Af-ary  expansions  of  d(u)  and  d(v)  differ  and  k  =  o,-_,  +  1. 

The  time  to  find  the  first  nonempty  segment  (by  scanning  over  buckets,  then  over  segments 
within  a  bucket)  is  0(B  +K).  The  total  time  for  distributing  vertices  among  segments  is  0(B) 
per  vertex.  The  total  running  time  of  the  method  is  thus  0(m  log  C  /  log  n  +  nB  +  nK).  Choos¬ 
ing  K  proportional  to  lg(nC)  /  lglg(nC)  gives  a  total  running  time  of 
0(m  log  C  /  log  n  +  n  log  ( n  C)  /  log  log  ( n  C))  =  0(m  log  C  /  log  n  +  n  log  C  /  log  log  C). 

Adding  an  extended  F-heap  to  represent  the  nonempty  segments,  as  in  Section  4,  reduces 
the  time  to  find  the  first  nonempty  segment  to  0(Qog(BK))2  /  log  n):  there  are  0(log  (BK)) 
steps,  each  of  which  manipulates  integers  in  the  interval  [LiMT),  which  is  the  range  of  the  seg¬ 
ment  indices.)  The  m  decrease  operations  on  the  F-heap  take  0(m  log  (BK)  /  log  n)  time.  The 
total  time  to  run  Dijkstra’s  algorithm  is  thus 

0(m  (log  C  /  log  n  +  log  (BK)  I  logit)  +  n(S  +  (log  (BK))1  /  log  n)).  Choosing  K  =  1a'  ,I?il 


if  lg(flC)  S  (lgn)2,  K  =  n  if  lg(nC)  >  (lgn)2  gives  a  total  running  time  of 
0(m  log  C  /  log  n  +n  Vlog  C ). 

It  is  worthwhile  to  compare  these  bounds  with  the  running  time  of  the  straight  F-heap-based 
algorithm  [7].  That  algorithm  requires  0(m  +  n  log  n)  steps,  each  of  which  involves  addition  or 
comparison  of  integers  in  the  range  [0..nC]  and  thus  takes  0(log  C  /  log  n)  time  in  the  semiloga- 
rithmic  model.  The  total  time  of  the  algorithm  is  thus  0(m  log  C  /  log  n  +  n  log  C),  the  same  as 
the  time  for  the  modified  Section  2  algorithm.  The  F-heap  algorithm  is  more  complicated,  how¬ 
ever.  The  algorithms  of  Sections  3  and  4  are  both  faster  than  the  F-heap  algorithm,  for  appropri¬ 
ate  values  of  the  parameters.  Note  that  the  time  to  read  the  problem  input  is  £2  (m  log  C  /  log  n). 
Assuming  that  solving  the  problem  requires  reading  the  input,  the  algorithm  that  combines  a 
two-level  distributive  heap  with  an  F-heap  is  optimum  to  within  a  constant  factor  if 
log  C  =  £2  ((log  n)2),  i.e.,  C  =  nn(lo*"),  or  ifm  =  £2  (n  log  n  /  Vlog  C ). 
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